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Abstract 

The sequence starts with o(l) = 1; to extend it one writes the sequence so far as XY^, where 
X and Y are strings of integers, Y is nonempty and k is as large as possible: then the next 
term is k. The sequence begins 1, 1, 2, 1, 1, 2, 2, 2, 3, 1, 1, 2, 1, 1, 2, 2, 2, 3, 2, ... A 4 
appears for the first time at position 220, but a 5 does not appear until about position 10^*^^^ . The 
main result of the paper is a proof that the sequence is unbounded. We also present results from 
extensive numerical investigations of the sequence and of certain derived sequences, culminating 
with a heuristic argument that t (for t = 5,6,...) appears for the first time at about position 
2 I (2 I (3 1 (4 I (5 1 ... t ((i — 2) | (i — 1)))))), where j denotes exponentiation. The final section 
discusses generalizations. 

1. Introduction 

This paper introduces an integer sequence A = a(l), a{2), a(3), . . . with some remarkable properties. 
Define the curling number C{U) of a string U = u{l), u{2), . . . , u(n) over some alphabet Q to be 
the largest integer k > 1 such that 

U = X YY . ^..Y^ = XY^ , (1) 
k copies 

where X and Y are strings over Q and Y is nonempty. Our sequence is defined by 

a(l) = 1, a(n+l) =C(a(l),...,a(n)) for n>l. (2) 

Then a(2) = C(l) = 1, since we can only take X to be the empty string 0, y = 1 and k = 1; 
a(3) = C(l, 1) = 2, by taking X = Y = 1, k = 2; a(4) = C(l, 1,2) = 1, by taking X = 1,1, 
Y = 2, k = 1 (as this example shows, there may be more than one choice for Y); and so on. The 
first 220 terms of A are shown in Tables 1 and 2. 

To avoid any possible confusion, for example with the "Say What You See" sequence studied 
in [3], we emphasize that the curling number does not depend on the decimal representation of its 
arguments. For example, if [/ = (8, 9, 10, 11, 11, 11), C{U) = 3. 

In Section 2 we describe the recursive structure of the sequence, in particular explaining the 
block structure visible in Tables 1 and 2. The proof that this structure is valid is postponed to 
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1 1 2 

1 1 2 2 2 3 

1 1 2 

1 1 2 2 2 3 2 

1 1 2 

1 1 2 2 2 3 

1 1 2 

1122232223222332 

1 1 2 

1 1 2 2 2 3 

1 1 2 

1 1 2 2 2 3 2 

1 1 2 

1 1 2 2 2 3 

1 1 2 

11222322232223322232 

Table 1: The first 98 terms of the sequence. In the notation to be introduced in Section 2, the 

five underhned strings are the glue strings s\^^ , S^^ , • • • , and the five bold-face strings are 

7^(1) 7^(1) ^(1) 
-^2 J ^3 '■■■'-'6 • 

Section 3, where we give the main results of the paper, Theorems 4 and 5. Corollary 7 shows that 
the sequence is unbounded. 

In Section 4 we give empirical estimates for the lengths of the blocks in the recursive structure, 
culminating in the estimate that t > 5 appears in the sequence for the first time at about position 

t-i 

2'" , (3) 

a tower of height t — 1. These estimates are based on examination of the first two million terms of 
the sequence A and of the higher-order sequences A^'^\ A^^^ and A^^'^ introduced in Section 2. 

The final section is devoted to comments and generalizations. §5.1 discusses a certain plausible 
"Finiteness Conjecture" that arises from studying curling numbers. §5.2 discusses sequences that 
are obtained when the "curling number transform" (defined below) is applied to certain well-known 
sequences. Finally, §5.3 briefly mentions some generalizations of our sequence, including a broad 
class of extensions suggested by J. Taylor [9]. 

Although the sequence A grows very slowly, there are certainly familiar sequences with an even 
slower growth rate, such as the inverse Ackermann function [1], the Davenport-Schinzel sequences 
[7], or the inverse to Harvey Friedman's sequence [4]. Nevertheless, we think the combination 
of slow growth, an unusual definition, and a remarkable recursive structure makes the sequence 
noteworthy. 

The sequence was invented by one of us (D.C.G.) while composing problems for the Dutch 
magazine Pythagoras. It now appears as sequence A90822 in [8]. 

Notation 

If is a set, il" denotes the strings of length n from 17, J^"*" is the set of all nonempty finite strings 
from fi, and fi* is the set of all finite or infinite strings from 0, including the empty string 0. 
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1 1 2 

1 1 2 2 2 3 
1 1 2 

1 1 2 2 2 3 2 
1 1 2 

1 1 2 2 2 3 
1 1 2 

1122232223222332 
1 1 2 

1 1 2 2 2 3 
1 1 2 

1 1 2 2 2 3 2 
1 1 2 

1 1 2 2 2 3 
1 1 2 

1122232223222332223222322233 
2223222322233334 



Table 2: Terms 99 through 220 of the sequence, up to the point where the first 4 appears (S'g is 
underhned, T^^^ is shown in bold-face). 

Strings wih usually be denoted by uppercase letters. The elements of a string may or may not be 
separated by commas, and a string may or may not be enclosed in parentheses. A sequence is an 
infinite string. The length of U G fi* (which may be oo) will be denoted by 1{U). 

Products in ^1* represent concatenation: if [/ G V then UV means U followed by 

V. We will usually not concatenate two infinite strings. A string U = u{l), . . . ,u{i) is said to be 
a substring of V = v{l), . . . ,v{j) if there is an r, < r < j — z, such that u{k) = v{k + r) for 
k = 1, . . . ,i; that is, if the elements of U occur consecutively in V. We say V contains U to indicate 
that U is a, substring of V. Terms such as prefix, suffix, etc., have their usual meanings — see [2] for 
formal definitions. A sequence U is said to be a subsequence of a sequence V if U can be obtained 
by deleting terms from V. 

Usually 0, will be either the nonnegative integers N = {0,1,2,3,...}, the positive integers 
P = {1, 2, 3, . . .}, or the set = m + 1, m + 2, . . .} for some integer m > 1. 

Given a sequence U = u{l),u{2), ... G f]*, its curling number transform is the sequence C/^= = 
tt*(l), u*(2), . . . G P* given by ii*(l) = 1 and 

-u^(i) =C('u(l),...,n(i-l)) for i>2. (4) 

It is immediate from the definition (2) that our sequence A is equal to its curling number transform, 
and in fact is the unique sequence with this property. 

2. The recursive structure 

We introduce the notation in three stages: informally, more formally and — in Section 3 — with a 
somewhat different emphasis that will be needed to prove the main theorems. 
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Informally, the sequence A is built up recursively from "blocks" Bn'' that are doubled at each 
step and are joined together by "glue" strings Sn^ . When the glue strings alone are concatenated 
together they form a sequence A^'^^ which has a similar structure to A: it is built up recursively 

(2) 

from blocks Bn that are repeated three times at each step and are joined together by "second- 

(2) 

order glue" strings Sn ■ When the second-order glue strings are concatenated together they form a 
sequence A^^^ which in turn has a similar structure, but now the blocks Bn^ are repeated four-fold 
at each step; and so on. The proof that this description is correct will be given in the next section. 

We now make this description more precise. The following description is correct, and is the best 
way to think about the sequence. However, we will not know for certain that it is correct until the 
end of Section 3. 

The sequence A is constructed from strings Bn^ and Sn \ n > 1, which we call "blocks" and 
"glue," respectively. The initial block is ^ = 1; the second block is B^^^ = 5^ ^ = 112, 
where S^^ = 2; the third block is 

Bi'^ = Bi'^Bi'^si'^ 

= 1 1 2 1 1 2 2 2 3, 

where 5*2^^ = 2 2 3, and so on, the n-th block for n > 2 being 

^(1) _ ^(1) ^(1) (K) 

where contains no I's. Then for all n > 1, A begins with Bn"^ (and hence A = lim B^^). 

n— >oo 

That is, for all n > 2, A begins with two copies of bI^}_^ followed by a "glue" string sl^}_^ that 
contains no I's. sl^}_i is terminated by the first 1 that follows the initial B^}_-^^bI^}_^. Table 1 shows 
b[^^ through B^^^ (the first row is -62^^ the first two rows together form B^\ . . ., and the whole 
table forms B^^), and Tables 1 and 2 together form B^^^ The glue strings s[^\ . . . , S^^^ are 
underlined. By iterating (5) we see that Bn^ can also be written as 

B^^^ = Bl^l,Bi:i,...B['^B[^^S^^^S^\..S^:i,. (6) 

The terminating string S'j^'^S'g^'* . . . sl^}_^ (denoted by TrP in Section 3) is shown in bold- face in 
Tables 1 and 2 for n = 2, . . . , 7. 

In Section 4 we state some conjectures about the lengths of the blocks Bn^ and of the glue 
strings -S^^i^. Assuming these conjectures are correct, /(S^^^^i) is much less than 1{B^}_^), and 
consequently l(Bn^) is roug hly twice l{B^n-i)- 

The above decomposition reduces the study of A to the study of the glue strings Sn'^ . We 
define the "second-order sequence" A^"^^ = a^^^(l), a^'^\2), a^^)(3),... to be the concatenation 
(S*^^^ 52^^53"^^ . . . oi the glue strings. It will be shown later that A^'^^ can also be defined by 

a(2)(l) = 2, 

a(2)(n + l) =C(2)(a(2)(l),a(2)(2),...,a(2)(n)) for n>l, (7) 

where we define 

C('")(?7) = max{m,C([/)} (8) 

for m > 1. That is, if C{U) = k \s less than m it is "promoted" to m (we will say more about 
"promotion" at the end of Section 3). Of course c(i) = C. 
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2 2 2 3 
2 2 2 3 
2 2 2 3 3 
2 2 2 3 
2 2 2 3 
2 2 2 3 3 
2 2 2 3 
2 2 2 3 

22233334 

2 2 2 3 
2 2 2 3 
2 2 2 3 3 
2 2 2 3 
2 2 2 3 
2 2 2 3 3 
2 2 2 3 
2 2 2 3 

22233334 

2 2 2 3 

2 2 2 3 

2 2 2 3 3 

2 2 2 3 

2 2 2 3 

2 2 2 3 3 

2 2 2 3 

2 2 2 3 

222333343 

Table 3: The first 127 terms of the second-order sequence 

^(2) (the successive second-order glue 
strings S^r^\ S^^^ are underlined; the strings T2^\t^'^\t^^\t^'^^ are shown in bold- face). 

The first 127 terms of A^"^^ are shown in Table 3, and the reader can verify that they may indeed 
be obtained by starting with 2 and repeatedly applying the map C^^^ . 

It is remarkable that A^'^^ has a similar structure to A, only now the blocks are repeated three 
times. That is, if we define ^ = 2, then for all n > 2, yl^^) 

gins with a block 

d(2) _ o(2) o(2) p(2) 0(2) /QN 
(2) (2) 

consisting of three copies of followed by a "second-order glue" string G P3 that contains 

(2) (2) (2) 

no I's or 2's and is terminated by the first number less than 3 that follows the initial B]^_'^B^_^B^_^. 
Table 3 shows uf^ (as well as sf'^ through S^-* ) . The glue strings sf'^ , , <S'3^'* , S^^^ are under- 

f2") f2") (2) f2) f2) 

lined. Bn ends with the string S{ S2 . ■ ■ S)^_i (denoted by Tn in Section 3); these strings are 
shown in bold- face in Table 3 for n = 2, . . . , 5. 

Again we have a conjectured estimate (see Section 4) for the lengths of the glue, which implies 
that l{Bii ) is roughly three times 1{B\_^). 

This analysis reduces the study of A^"^^ to the study of the second-order glue strings Sn'^ , and 
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these, when concatenated, form the third-order sequence A^^^ , which in turn has a similar structure. 
And so on! 



3. The main theorems 

In this section we estabhsh our main theorems, which wih show that the description of the sequence 
given in Section 2 is correct. To do this we must introduce our notation very carefuhy. The 
fohowing definitions (temporarily) supersede those in Section 2. For m > 1, the mth-order sequence 
^(m) ^ a('")(l),a(™)(2),a(™)(3), ... € P^;, is defined by 

a(™)(l) = m, 

a("^)(i + l) = C(™)(a(™)(l),...,a(™)(i)) for i>l, (10) 

where C^™) is defined in (8). Note that A^^) is our sequence A. Theorem 4 will show that ^("^+1) 
is the concatenation of the glue strings for A^"^^ . 

For m > 1, n > 1, the blocks B^'^ € and the glue strings 5^™^ € ^m+i defined 
recursively, and independently of the A^^\ Corollary 7 will show that all the strings -Bi"^\ sIT'^ 
and Tn"^^ (defined below) are in fact finite, but at this point we do not know that, and the definitions 
must allow for the possibility that some of these strings may be infinite. 

The recursion for the blocks is 

B["'^=m, (11) 

and, for n > 1, 

„M _ / (i?i™))-+i5^) if liB^rT^) < oo , 

^n+l-|^M if/(i?M) = oO, ^ ^ 

where S^^ wiU be constructed from B^"^ . If /(^i™^) = oo, 5^™^ = for i > n. If /(si^^) < oo, 
consider the sequence Sn "^(1), s^)(2), s^)(3), ...eFl, defined by 

s(r^(l) = C("^)((S^'"))™+^), 

s(r)(z+i) = c(™)((i?(-))™+i.(r)(i)...s(r)(i)) for i>i. (13) 

Clearly s^\l) > m + 1. If there is an integer i > 1 such that Sn"^(z + 1) < m + 1, choose the 
smallest such i, and set 

= .(r)(i), 4'")(2), . . . , 4-)(z) G P++1 , (14) 

but if no such i exists set 

S(-)=4™)(l),s(r)(2),...GP;,+i. (15) 

In the latter case S^rT^ and B^' '^-^ are infinite. 

The Tir^ are defined as follows. For n > 1, if . . . , sii^^ are finite we set 

ra = 5i-)...5r)€P++,, (16) 

while if ^{'"^ . . . , 5^1 ^\ are finite but S'i'"^ is infinite, we still use (16) and define 
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for i > 71 + 2. In the latter case all the T^^™^ for i > n + 1 are infinite. Note that T^^™^ is always 
undefined. 

The lengths of these strings (which may be infinite) are denoted by 

= ^(4™)), (18) 

C7(™)(n) = ^(5^^")), (19) 

rM(n) = l{Tjr^). (20) 



We also let = 6('")(1), 6('")(2), 6('")(3), . . . = lim„^oo ^i"^ • This is weU defined since each 

Bir^ starts with bI^\. 

We will require three lemmas. 

Lemma 1. For m > 1, if A^"^^ contains a string U^~^^ G for some t > m, then U € P^. 

Proof. If t = m the claim is trivially true, so we may assume t > m + 1. Suppose, on the contrary, 
that U ^ P^. Then we may write U = GiH for G, H ^ P,^ and some i with m < i < t — 1. Thus 
A contains GiH GiH . . . GiH {t + 1 copies). But the final i is preceded by t copies of iHG, so the 
final i must be at least t, by definition of A^"^^ , a contradiction. ■ 

Lemma 2. For m > 1, n > 2, (a) Tn""^ is a suffix of B^\ and (b) this is the only occurrence of 
Tn"^^ as a substring o/i?i™\ 

Proof. Fix m > 1. It follows by iterating (12) that 

= (i?("^))"^(i?S)™.--(i?i™))"^i?J'")Ti™J, (21) 

provided all of s'^'^ S^"^^ . . . S^^^ are finite. If sj'^^sj"'^ . . . s';^\ are finite but sj^'^ is infinite, (21) 
is still true, but 

^(m) ^ ^ ^(™) ^ for i > „ + 2 . (22) 

Assertion (a) follows at once. To show (b) we use induction on n. The base case, n = 2, is true 
because T2™'^ = m + 1 and B^^^ = {m"^~^^,m + 1). If is infinite and has two occurrences in 
B^i , they are both suffixes of -B^'^i , implying that T^+j is a suffix of itself, and hence is a periodic 

sequence. But this is impossible: let Mq be the maximal element of T^\. After sufficiently many 
terms the curling number given by (13) would produce a term greater exceeding Mq, a contradiction. 
On the other hand, suppose that all the are finite. If also occurs in B^^^ other than as 
a suffix, it must be a substring of a block Bj^^ in (21), for some j with 2 < j < n, for otherwise it 
would contain the m at the beginning of a bbck. Write B^""^ = UT^^^lv = UT^'^'^ sf^ ■ ■ ■ S^T^V 

for some U e F+ and V e F^. But /(sj""^ • • • si*"^) > 0, so t]™^ occurs as a non-suffix in B^"'\ a 
contradiction to the induction hypothesis. ■ 

Remark. It follows from the above proof that, for any r with 1 < r < m + 1, any finite substring 
{B^^Y in (21) contains exactly r copies of each one occurring at the end of a B^\ The 

copies are disjoint. 
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Lemma 3. For m > 1, n > 2, suppose that k = >m + l with 1 <i< p'^"^\n). Then there 

exists a Y such that b^"^\l), . . . ,b^^\i — 1) = XY''. Moreover, let Y satisfy this condition with 
l(Y) minimal and suppose m GY. Then Y = Bj^^ for some j with 1 < j < n — 1. 

Proof. We fix 77T, > 1, and will prove the result for all n by induction. The base case n = 2 is 
immediate, since B^^ = (jn™"''^, m + 1). Supposing the result holds for some 7i > 2, we will show 
it holds for n+ 1. If i?^™^ is infinite then the result also holds for n+ 1, by (12), so we may assume 
that Si"'^ is finite. Then bI^\ = (si'^^)("^+i)5i'"\ by (12). We must show that the result holds 
for all positions /5(™)(n) < i < fj'^'^\n + 1). 

If z is a position in (i?!™^)"^"*"^, we may write i = r(3^''^\n) + s < (m + l)/9^™'^(n), for 1 < r < m, 
1 < s < /3("')(n). Then 5('^)(i) = 6('")(s) and by induction we know that in the first .bI™'^ we can 
write b^"^\l), . . . , b^'^\s — 1) = XY^, and if the minimal Y contains an m then it equals Bj"^^ for 
some j with 1 < j < n — 1. Therefore this Y (and no shorter string) can also be used at position 
i, and thus the statement holds. 

If i = (m + l)/3('")(n) + 1 then the part preceding i is (S^"*^)"*"''^, and from (13) we have 

k = 6(™)(i) = 4"^)(1) = C^"'\{B^^^r+^) >m + l. 

So certainly one Y exists with ^(''"^(l), . . . , b^^^\i — 1) = XY^ . We must show that if the minimal 
Y satisfying this property contains an m, then that Y = i^j'"^ for some j < n. IfY contains an m, 
then it contains T^"^^ as a substring, since the last m in bI^^ occurs before the Tn™^ in the last 
copy of B^^ . Therefore the string Y^ contains at least k > m + 1 copies of T^"^^ . It follows from 
the Remark below Lemma 2 that k = m + 1 and Y = . 

If z > {m + l)/3('")(n) + 1 we see by the definition of S*!™^ that again a Y exists. If Y contains 

an m, then it must properly contain the Tn"^^ in the final copy of But in the last of the k 

copies of Y the copy of is followed by an integer larger than m, whereas in the earlier k — 1 
copies it was followed by the first element of -6^"*^ which is m. This is a contradiction, and shows 
that in this case Y cannot contain an m. ■ 

Note that, by definition of sl^^, the Y^ for the first element of Sn^^ goes back further than 

b'^\ and thus contains an m. Therefore we see that the situation described in the penultimate 
paragraph of the above proof is indeed the case and we may conclude that 

sM(i) = m + 1 for ah m > 1, n > 1 . (23) 

At this point we can already see that the concatenation of the glue strings is equal to the next 
A sequence: 

Theorem 4. Suppose m > 1. For all n > 2, T^™^ is a prefix of A("'+^\ or is all o/ ^("^+1) ifT^T^ 
is infinite. 

Proof. Again we fix m and use induction on n. For n = 2 the result is trivial. Supposing the 
result holds for some n > 2, we will show it holds for n + 1. If T^™^ is infinite then clearly the 
result holds for T^\, so assume that Tn"^^ is finite. 

Write (Si'"^)™+i = f/ri™^ for some U e P+. We know that si^^ begins with m + 1 = si"'^(l) = 
C('^)(C/ri'"^) = C("^+^)(C/ri™^) = C("^+i)(Ti"'^). The last equality holds because dropping the U 
can only decrease the value, but it is already equal to its minimal value of m + 1. By the induction 
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hypothesis, r„ is a prefix of ^ and therefore T„ s„ (1) is a prefix of A^"^^^\ For i > 1, 

as long as si™^ (i) > m + 1 , we have 

.(r)(i + i) = c(-)(c/r(-).(r)(i)...4-)(z)) 

= c(-+^)(T(-)4-)(i)...4-)(i)). 

The second equahty holds because s^\i) > m + 1. The third equality holds because 

Sn^\i + 1) goes back no further than the beginning of as we saw in the proof of the previous 

lemma. Hence T^^i = T^'^ S^'^ is a prefix of ^4^'""'"^^ as required. ■ 

Theorem 5. For all m > 1, the sequences A^"^^ and coincide. 

Proof. Fix m> 1. We will show by induction on n that, for all n > 1, is a prefix of A^"^^ or 
is all of A^"^^ if is infinite. This will establish the theorem. 

The cases 7i = 1 and n = 2 are immediate, since B^^ = m, b!^'^ = {m'^^^,m + 1). So assume 
the truth of the induction hypothesis up to and including some n > 2. 

If B^i^^ is infinite the result follows from (12), so we may assume that Bn^^ and hence Tn"^^ are 
finite. We wish to show that B^^^ = {B^^)"^~^^ sii^^ is a prefix of A^^'\ If this is not true, the 
first discrepancy between B^^^ and occurs in the substring (5^'"^)™'+^, by the definition of 

Sl^"\ Let i > /3(™)(n) + 1 be the first position in at which a(™)(i) / b^"^\i). Our goal 

is to show that the existence of i leads to a contradiction. 

We may write i = (n) + r with 1 < j < m and 1 < r < fi^"^\n). Then i is also minimal 

with respect to the condition that a(™)(i) ^ a'^"'\r). Let «('")(!), . . . ,a('")(i - 1) = XY^ with k 
maximal and 1{Y) minimal. Then a^'^^i) = max{m, A;}. 

We consider two cases, depending on whether or not a^"^^ (i) is at the beginning of one of the 
Bn^^ blocks, i.e. whether r = 1 or r > 2. 

First, suppose r = 1; then we need to prove that a^"^\i) = a^™^(l) = m. This follows by 
definition of S*^™! if j = 1, so assume j > 2, and that k = a^"^\i) > m + 1. Using (21) we may write 
a("^) (1), . . . , {i-l) = {B^^y = {B^'^y^^UmTjr'' for some U £F*^.lf Tr^^^ is a proper suffix 
of then m € Y, which implies that T^l™^ is a proper suffix of Y and therefore (B^^y contains 
at least m + 1 copies of Tn"^\ contradicting the Remark following Lemma 2. On the other hand, if 
Y^ were a suffix of this would contradict the fact that is followed by an element < m. 

Second, suppose that r > 2. Let L = a^"^\l), . . . ,a^'^\r — 1) and write L = X^X^* with fc* 
maximal and /(I"*) minimal. Then a^™^(r) =max{?7i, A;^,}. By the definition of i, 
m. Hence a^^\i) = k > m+ 1. To have a^"^\i) > a(™)(r), L must be a suffix of Y^, so m gY and 
therefore, by Lemma 1, /c is at most m + 1 and therefore is equal to m + 1. Hence fe* < m. 

The situation, then, is that {B'^'^y L is a prefix of A^'^\ We are supposing that we can achieve 
a{m){i) = m + 1 by allowing L to be a suffix of y'"+^ Noting that T^™^ is a suffix of {B^^^^y, by 
(21), we distinguish two cases, depending on the relationship between Tn^^ L and y™+i. 

(i) Suppose that L is a suffix of "We know m G y and m so contains 

at least m disjoint copies of T^"^\ Hence j = m, and there are exactly m disjoint copies, by the 
Remark following Lemma 2. This means that each copy of T^'^ straddles the end of one copy of 
Y and the beginning of the next (if not, T^"^^ is wholly contained in Y , and so there are m + 1 
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copies of Tn"^^ in the sequence before position i, which is a contradiction since there are only m 
copies, one in each of the m copies of si'"^ and none so far in the next copy of that we 

are building), and hence that y is a proper suffix of tIT^L. Write T^™^ = VW where W is the 
intersection of T^™^ and the last (or {m + l)-st) copy of Y, and write = UT^\ using (21). 
If m > 2 it is easy to complete the proof. We have Y = WL = WUV, so L = UV and therefore 
i > /(y"*+i) = (m + l)l{WUV) = {m + 1)1{UVW) = (m + l)/?("')(n), contradicting the definition 
of i. 

Suppose then that m = 1. Again L is a proper suffix of Y and y is a proper suffix of T^f^ L. 
Write y = WL, and let s > 2 be the first element of Y . Let this element s in the second copy of 
y be preceded by s copies of some string Y' with 1{Y') minimal. 

Suppose that Y' does not contain a 1. Since Y does contain a 1 (L starts with a 1), Y'^ is a 
suffix of the first copy of y , and hence also of the second copy of Y . This contradicts the minimality 
of 1{Y), since then 1{Y') < 1{Y). 

So we may assume that 1 E y', hence by Lemma 3 we know that 

Y' = 

for some k < n. T^^^ is a suffix of Y^ and since L starts with a 1, T^"^^ is also a suffix of L. By 
Lemma 2, bI^^ is also a suffix of L. Suppose Bj^^ = L. Then WL is a suffix of Y'Y' = LL (look at 
the first copy of y = WL and remember Y'Y' begins with a 1) and hence is a suffix of L. But 
then W"^ is a suffix of Ti^\ contradicting the fact that L starts with a 1. 

So we may assume that Bk is a strict suffix of L. But now l(Y) >1{L)> 2l{B^n^). (Indeed, if 
1{L) < 21{B^}^), then we know that L is a prefix of Bn \ by definition, bI^'^ B^^^ is also a prefix of 
Bn \ and so L is a strict prefix of B^'^ B^'^; but L has suffix, so by Lemma 2, L = B^\ 

a contradiction.) But now is a suffix of y, contradicting the minimality of Y . 

(ii) Suppose on the other hand that y™-+i is a suffix of Tn"^^ L. Since no Y is contained in T^™^ 
(remember that m €Y), Y"^ is a suffix of L and the first element, t, of y is in T^™^ with t>m+l. 
Therefore the first element of the second Y is also t and since (i?n™^)-'L is a prefix of A^"^\ Y 
ends with [/* for some U. Hence f/* is a suffix of L, which contradicts the fact that /c* = m. This 
completes the proof. ■ 

Corollary 6. The sequence contains every integer > m. 

Proof. From Theorem 4 we know that, for m > 2, n > 2, is a prefix of A^'^\ so, for a 

given m, either 

4(m.) _ Q(m-l) ^(m-l) ^-.(m-l) 

-/I — \_ 9 * * ■ 71 

if some SjY^ is infinite, or 

A{m) _ Q(m-l) g(m-l) g(m-l) 
J\ — ^1 '-'2 '-'3 

if all S^r^'^^ are finite. Also, by Theorem 5, is a prefix of A^"^ ^\ so from (12), if some 

(Sn™ is infinite, A^™"^) contains 

Q{m~l) Q(m-l) Q(m-l) 
0| ,02 5 • • • ) ' 

or if ah are finite, A^"^ contains S"^™ ^\ Sg™ . . . , S"!™ for all n. In either case (and 

this is the key point), every prefix of A^™) is a subsequence of A^'^~^\ Repeating this argument 
shows that every prefix of every is a subsequence of ^('"^ if j > m. 
Since A^^^ begins with j, ^4^"^^ contains every integer j > m. ■ 
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Corollary 7. The strings Bn , Sn and Tn have finite length. 

Proof. The first occurrence of an integer in A^™^ is necessarily followed by an m. Since we saw in 
the previous corollary that A^"^^ contains infinitely many different integers, it follows that all Sn"^^ 
are finite. This implies that -B^™"^ and T^™"^ are also finite. ■ 



Promotion 

In the definition of A^'"), (10), let us say that a^"^\i) is promoted if either i = 1 or 

< m. If we know which elements in A^"^^^' are promoted, we can recover 
^(■m) fj-Qj^ ^(m+i)^ make this precise, we define the strings D^"^^ € by Dq™^ = m and, for 
i > 0, 

(^) f DSa("^+^)(i) if a(™+i)(«) is not promoted, 

~' if a(™+i)(i) is promoted. ^^^^ 

Since D^"^^ starts with D^^\, we can define the limiting sequence 

^(m) ^ Yirai^^D'f\ Then it 

can be shown that: 

Theorem 8. For all m > 1, the sequences A^™'^ and coincide. 

We omit the proof, which involves arguments similar to those used to prove Theorems 4 and 
5. The main difference is that this proof does not require the finiteness of the glue strings sli^\ 
Furthermore, the glue strings now by definition unite to form the next A-sequence, but on the other 
hand it becomes more difficult to show that they are indeed substrings of A^"^^ itself. 



4. Estimates for the rate of growth 

In this section we take an experimental approach, and record a series of observations about the 
sequence. These observations appear to be correct, but we have been unable to prove them. In 
§4.1 we study the lengths of the glue strings sli^\ Although these lengths are somewhat irregular, 
it appears that they can be "smoothed" so as to become much more regular "ruler" sequences, 
whose peak values will be denoted by p^"^\n). In §4.2 we describe a "tabular" construction for the 
higher-order sequences A^'^\ A^^\ . . . which leads to a recurrence relating the p^"^\n), f3^"^\n) and 
Sections 4.3, 4.4 and 4.5 contain estimates for P^"^\n), and T^"^\n). Finally, in 

§4.6, we use these estimates to determine where each number t > 1 appears for the first time in 
our sequence A. 



4.1. Ruler sequences and smoothing 

It appears that the sequence cr^'") = a^"^\l) , a^"^\2) , a^^\3) , . . . giving the lengths of the glue 
strings S'n"^^ is essentially a "ruler" sequence, in the sense that cj^'"-' (n) essentially depends only on 
the (m + l)-adic valuation of n. 

For positive integers m, n, define the m-adic valuation of 

|^|m; to be the highest power of vn 
dividing n. The classical example of a ruler sequence is the sequence r = r(l), r(2), r(3), . . . given 

by 

r(n) = |n|2 + l. (25) 
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The first 32 terms are 



1213121412131215 
1213121412131216 

where the new record entries, shown in bold-face, occur at powers of 2. For much more about this 
sequence, including an extensive bibliography, see entry A1511 in [8]. 

The initial values of (t^^\ ■ ■ ■ , cr^^^ are shown in Table 4, and the record entries in cr^^\ . . . , a^^^^ 



I 5. Let 


7r(™) 


(j) (j 


> 


0) denote 


the 


j-th record 


in 
















n 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


13 


14 


15 


16 




1 


3 


1 


9 


4 


24 


1 


3 


1 


9 


4 


67 


1 


3 


1 


9 


a(2)(n) 


1 


1 


3 


1 


1 


3 


1 


1 


9 


1 


1 


3 


1 


1 


3 


1 


£7(3) (n) 


1 


1 


1 


3 


1 


1 


1 


3 


1 


1 


1 


3 


1 


1 


1 


10 


£7(4) (n) 


1 


1 


1 


1 


3 


1 


1 


1 


1 


3 


1 


1 


1 


1 


3 


1 


n 


17 


18 


19 


20 


21 


22 


23 


24 


25 


26 


27 


28 


29 


30 


31 


32 


£7(1) (n) 


4 


24 


1 


3 


1 


9 


4 


196 


3 


1 


9 


4 


24 


1 


3 


1 


£7(2) (n) 


1 


9 


1 


1 


3 


1 


1 


3 


1 


1 


32 


1 


3 


1 


1 


3 


£7(3) (n) 


1 


1 


1 


3 


1 


1 


1 


3 


1 


1 


1 


3 


1 


1 


1 


10 


£7(4) (n) 


1 


1 


1 


11 


1 


1 


1 


1 


3 


1 


1 


1 


1 


3 


1 


1 


n 


33 


34 


35 


36 


37 


38 


39 


40 


41 


42 


43 


44 


45 


46 


47 


48 


£7(1) (n) 


9 


4 


68 


3 


1 


9 


4 


24 


1 


3 


1 


9 


4 


581 


3 


1 



Table 4: Values of a^^\n) for n < 48 and a^'^\n), a^^\n), a^^\n), for n < 32, with record entries 
shown in bold-face. 



m\j 





1 


2 


3 


4 


5 


6 


7 


8 


9 


1 


1 


3 


9 


24 


67 


196 


581 


1731 


5180 


15534 


2 


1 


3 


9 


32 


119 


463 


1837 


7332 


29307 


117203 


3 


1 


3 


10 


42 


200 


983 


4892 


24434 


122141 




4 


1 


3 


11 


55 


315 


1872 


11205 


67195 






5 


1 


3 


12 


70 


471 


3273 


22883 








6 


1 


3 


13 


87 


673 


5355 


42805 








7 


1 


3 


14 


106 


927 


8309 


74740 








8 


1 


3 


15 


127 


1239 


12351 


123463 








9 


1 


3 


16 


150 


1615 


17721 










10 


1 


3 


17 


175 


2061 


24683 











Table 5: Values of vr("*)(j), the j-th record in sequence a^"^\ The smoothed record values p^"^^j) 
are obtained by reducing the italicized entries by 1. The next three terms in the first row are 46578 , 
139713 , 419116, and the next term in the m = 2 row is 468785. The missing entries in this table 
have not been calculated, although we predict that the entries on or below the diagonal m = j are 
given by (40) and the entries just above this diagonal by (41). 
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As can be seen from Table 4, is not quite as regular as the ruler sequence r. However: 

Observation 9. // the sequence a^^^ is "smoothed" by replacing every instance of 4 by the pair of 
numbers 3, 1, every 9 by 8, 1, every 25 by 24, 1, and so on, a^^^ becomes a ruler sequence r^^^ given 
by 

r«(n) =p«(|n|2), (26) 

in which the first 64 terms are 

1318131 24 1318131 67 

1 3 1 8 1 3 1 24 1 3 1 8 1 3 1 195 

1 3 1 8 1 3 1 24 1 3 1 8 1 3 1 67 

1 3 1 8 1 3 1 24 1 3 1 8 1 3 1 580 

and where the record values (shown in bold-face) p^^\0) , p^^\l) , . . . are 

1, 3, 8, 24, 67, 195, 580, 1730, 5179, 15533, 46578, 139712, 419115, .... (27) 

The numbers i in a^^^ that are to be replaced by i — 1, 1 to get r^^^ are 

4, 9, 25, 68, 196, 581, 1731, 5180, 15534, 46579, 139713, 419116, . . . , (28) 

The numbers that need to be smoothed, given in (28), are one greater than the numbers in 
(27), except that 2 is missing. The records in the smoothed sequence r^^\ (27), either agree with 
or are one less than the terms in the first row of Table 5. 

The sequences a^"^^ for m > 2 appear to need less smoothing than a^^^ to make them into ruler 
sequences. In the range of our tables, a^"^^ needs to be smoothed by replacing every 32 by 31, 1, 
and every 7332 by 7331, 1; a^^^ by replacing every 200 by 199, 1; a^^^ by replacing every 1872 by 
1871, 1; and so on. If r^"^^ denotes the smoothed version of cr^™^ and /0^™Hj) the j-th. record in the 
smoothed version (see Table 5) then we have, for all m > 1, n > 1, 

rM(n) = /,(™)(|nU+i). (29) 

The lengths /^("^^(n) of the blocks are given by (from (12), (18)) 

/3(™)(1) = 1, 

/3("^) {n + 1) = (m + l)/3(™) (n) + a^™) (n) for n > 1 . (30) 
The initial values of P^^\n), . . . , P^^\n) are shown in Table 6. 



m\n 


1 


2 


3 


4 


5 


6 


7 


8 


1 


1 


3 


9 


19 


47 


98 


220 


441 


2 


1 


4 


13 


42 


127 


382 


1149 


3448 


3 


1 


5 


21 


85 


343 


1373 


5493 


21973 


4 


1 


6 


31 


156 


781 


3908 


19541 


97706 


5 


1 


7 


43 


259 


1555 


9331 


55989 


335935 


6 


1 


8 


57 


400 


2801 


19608 


137257 


960802 



Table 6: Lengths /3(™)(n) of the blocks B, 
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4.2. The tabular construction 



The appearance of ruler sequences can be partially explained if we present the construction of the 
higher-order sequences in a tabular format. In this construction we keep track not 

only of the actual value A^^\n) = max{m, k} (cf. (10)) but also whether the promotion rule was 
invoked (if /c < m we indicate this by drawing a circle around the entry) and the length of the 
shortest Y that was used to compute k if k > m (shown as a subscript; if the promotion rule was 
invoked the subscript is 0). This tabular construction will also suggest a recurrence that relates 
p("^)(n + 1), /3('"+i)(n + 1) and a^'^+^^n + 1). 

We will construct A^"^^ as an example. We start by making a small table of the glue strings Sn^ 
for n < 10 — see Table 7. (We aheady saw s[^\ . . . , sf^ in Table 3.) 



n 


o{2) 


1 


3 


2 


3 


3 


3 3 4 


4 


3 


5 


3 


6 


3 3 4 


7 


3 


8 


3 


9 


334333344 


10 


3 



(2) 

Table 7: The first few glue strings Sn ■ 
We know from Section 3 that A^^) = lim M^) = hm T^^^ = s\^^ si^K . . and that B^^L = 

n^oo n^oo 

Table 8 shows the beginning of the construction of A^'^^ . 
The aim is to understand how A^^^ breaks into the consecutive Sn^ glue strings for A'^^K To 
do this, a version of A^"^^ is produced in which terms that are obtained by promotion are circled, 
and where the subscript on each term is either for a circled term or else gives the length of the 
shortest Y that can be used to compute that term. The circled terms will be the first terms of each 
of the glue strings Sn^ of A^^^ . Most of the circling and subscripting work is done by a few simple 
rules. However, the rules occasionally give the wrong answer and a few corrections may need to be 
made by hand at the end of each round. It is the presence of these adjustments that makes our 
sequence hard to analyze. 

(2) (2) (2) 

We start with = @q. The rules for going from i?„ to B^^^^ are as follows: 

(2) 

(i) Write B n as a single string, and construct a three-rowed array in which each row is a copy of 

(2) . . (2) 

Bn , omitting all circles from the third row. This three-rowed array (after Sn is appended in step 
(iii)) will form B^^^ when read as a single string. (When constructing A*^"*) we make m copies of 
bIi^^ and omit the circles from the m-th copy.) 

(ii) The subscripts in rows 2 and 3 are the same as in row 1, except that terms in row 3 that are 

(2) 

under circled terms in row 2 have their subscripts changed to l{Bn )■ 

(2) (2) (2) 

(iii) Append 5„, to the end of row 3. The first term of Sn receives the subscript l{Bn )■ The 
subscripts on the remaining terms of Sn must be computed separately — they can be obtained 
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from the tabular construction of A^"^^^^ . 

(iv) Finally, a few circles in row 2 may need to be omitted and their subscripts recomputed, as well 
as the subscripts on the same terms in row 3. 

(2) (2) (2) 

In Table 8, rules (i)-(iii) give the correct answers for B2 and -B3 . But in four terms 
(marked with asterisks in Table 8) must be corrected. The first entry in row 2 of is (2)o. 
However, row 1 ends with 3 3 = S^^ S^^ = Y"^, with a y of length 1, so that 2 did not need to be 



(2) 

promoted and we must change (2)o to 2i. The fifth entry in row 2 of -B4 is But it is preceded 
by 



32223 32223 = S'f ^ S^^^ 5^ ^ sj^^ = Y"^ , 
with a y of length 5, so we must change (2)o to 25. The corresponding entries in row 3, presently 



equal to 2 13 


, also 


get chanj 


;ed to 2i 


and 25 respectively. 










®Q 




















®0 




















@0 




















2i 


3i 
















bP = 


@0 


(2)0 


2i 


3i 














(2)0 


(2)0 


2i 


3i 














24 


24 


2i 


3i 


34 










Bf^ = 


(2)0 


(2)0 


2i 


3i 


(2)o 


(2)o 2i 3i 24 24 


2i 


3i 


34 




®*o 


(2)0 


2i 


3i 


®o 


@o 2i 3i 24 24 


2i 


3i 


34 




^13 


2l3 


2i 


3i 


0* 

^13 


2i3 2i 3i 24 24 


2i 


3i 


34 



Table 8: Tabular construction of A^'^\ 
(2) 

When we extend Table 8 to Bj^^ , we find that in all only ten circles need to be removed. After 

(2) (2) (2) (2) 

Bl , the next changes are at i?^ , where two circles get removed because of the splittings 

= 3 3 = Y^, with a y of length 1, and sf^Bf^sf^B^ = Y^ with a y of length 128. But not 
all instances of such splittings cause circles in the table to be removed, and not all circle-removals 
arise in this way. It seems difficult to explain exactly where corrections to the table are required. 

However, the corrections are rare, and still fewer corrections are needed for larger values of m. 

Since A^"^^ is also lim Tjj^\ we can read off the lengths of the glue strings Sn^ from the table. 

n— »oo 

(2) 

Look at the lengths of the strings (in B^ ) between one circle and the next: these are 1, 3, 1, 9, 4, 24, 
. . .. exactly the values of cj(i)(l), cj(^)(2), . . . (cf. Table 4). If we do not make the corrections needed 
in step (iv), we instead get the smoothed lengths 1, 3, 1, 8, 1, 3, 1, 24, .... These observations 

(2) 

lead to our conjectured recurrence. For example, note that the string in B^ ' from the last circled 

i(^)|'.3')'l anrl is madp iin nf the last strincr in rI^'^ 



entry to the end has length 24 (which is '{3)) and is made up of the last string in Bj^ (length 

(length 13, which is /3(2)(3)), plus sf 



8, which is p^^\2)) plus the whole of Bf^ (length 13, which is /3(^)(3)), plus sf^ (length 3, which 



is C7"(^^(3)). More generally, we have: 
Observation 10. For m > 1, 

p(™)(0) = 1, 



P 



(m) 



{n + 1) = (n) + /3('"+^) (n + 1) + a^'^+'^^ {n+l) forn>Q. (31) 
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This recurrence is supported by all the data, although we do not have a proof. 



4.3. Estimates for the lengths (3^^\n) of the blocks 

In this section we first prove formulas (33), (34), (35), which give the exact vlaue of P^"^\n) in the 
parabolic region 1 < n < (m + 1)^ — 1 for m > 1. We then give conjectural estimates for /3(™)(n) 
(indicated by ~), (36) and (37), which apply for all m and n. 

We take (30) as our starting point. For m > 1, a^'^\n) and r(™)(n) coincide for 1 < n < 
(m + 1)^ — 1, and in that range are given by 

^{m)(._(m)(._j^ if m + 1 does not divide n , 

a [n)-r W-jg if ^ + i divides n . ^"^^^ 
By iterating (30) and using (32) we find: 
Lemma 11. For m > 1 and 1 < n < (m + 1)^ — 1, 

m [m + — 1 

where v € {0, 1, . . . , m} is given by n — 1 = v mod m + 1. 

In particular, for l<n<m+lwe have v = n — 1 and so 

/3(-)(n)= ("^ + ^^"-\ (34) 
m 

and for m + 2 < n < 2m + 2 we have v = n — m — 2 and 

. ) ^ (m + 1)" + 2{m + 1)—^ - 2(m + 1)"-^ - 1 _ 

m 

Equation (34) explains the entries on or below the diagonal n = m + 1 in Table 6, (35) explains 
the entries in the "wedge" m + 2 < n < 2m + 2, and (33) the entries in the whole parabolic region 
bounded by n < (m + 1)^ — 1. 

The right-hand side of (33) is also a good approximation to P^"^\n) for fixed m > 2 and 
n > (m + 1)^. The case m = 1 is special, because of the greater differences between a^"^^{n) and 
r^"^\n) when m = 1. However, P^^\n) is well-approximated by 

/3W(n) ^ ei2"-i, (36) 

where ei = 3.48669886 . . .. 

For our applications, the approximation 

/3('^)(n) ^ em{m + 1)"-^ for m > l,n > 1 (37) 

(consistent with (33)-(36)) will be adequate, where Em is a constant on the order of 1. 
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4.4. Estimates for the records p^'^h 



n] 



We now apply (31) to estimate p^™) (n). Except for (38), the formulas in this section are conjectm'al. 
Again using (32), we find that 



m{n + l + 2u)+ /3(™+i) (n + 1) 
m + 1 



pM (n) = -n-^^^^^'^y^M- ^^^^ ^^^^ 



for < n < (m + 2)^ - 1, where u = [n/(m + 2)J. Eliminating /3(™+^)(n + 1) from (31) and (38) 
we obtain 

(n + 1) = (m + 2)/)("*) (n) + (n + 1) - m(?i + 1 + 2n) . (39) 

In particular, for 0<n<m + l, n = and so (38) gives 



[m + 1)^ 



while for m + 2 < n < 2m + 3, n = 1 and 

(m)r ,_ {^ + 2)"+^ + 2{m + l)(m + 2)"-"^-^ + (n + 3)m(m + 1) - 1 
^ (m + l)2 • ^^^^ 

Equation (40) matches the smoothed values on or below the diagonal n = m + 1 in Table 5, (41) 
matches the values in the "wedge" m + 2 < n < 2m + 3, and (38) matches the values in the whole 
region bounded by the "parabola" n < {m + 2)^ — 1. Equation (41) is in fact a good estimate of 
pi'^)[n) for all m and n. 

The greatest differences between the exact values 7T^"^\n) and the smoothed values p(™)(n) 
occur in the first row of Table 5. The ratio of terms TT^'^\n + l)/TT^'^\n) in row m of that table 
rapidly approaches m + 2, and for fixed m we find that 

vr(™)(n) ^ A™(m + 2)-, (42) 

where approximate values of Am are 

m : 1 2 3 4 5 6 7 ••• 
A„ : .778 .447 .312 .240 .194 .163 .140 ••• 



Curve-fitting suggests that 
which we approximate by 
leading to 



.956m + 2.11 
(m + 1)2 

m + 2 



(m + 1)2 ' 



(43) 



vrM(n)«i!^^±^ (44) 
^ ^ (m + l)2 ^^^^ 

for m fixed and n large. 

Since the leading terms in (38), (40) and (41) agree with (44), we will take (44) as our approx- 
imation to both TT^"^\n) and p^'^\n) for all m and n. 
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4.5. An estimate for T^'^\n) 
From equations (16), (20) we have 

n 

r(")(n + l) = ^cT('")(i), n>l. (45) 

i=l 

To simplify the analysis (we are only seeking a crude estimate in this section) we suppose we 
have reached the end of block si™^ in A^"^\ where n = (m + 1)^' + 1 for some /U > 1. This block 
ends with the string Tn'"'* of length r^'")(n). 

Of the (m + 1)^ strings 1 < i < {m + l)'^, that appear in Tn"^\ a fraction ^^^-j- have 

= and contribute vr^^Vo) to the sum; a fraction , "^-.-.^ have = 1 and contribute 

TT^"^\l); and so on. Therefore, from (45), 

((^ + 1)M + 1) = + 1)M ^ ^^^M (,) + ^ 

i=0 

where the last term accounts for the final glue string S^^^. Using (44) this becomes 

r^"^) ((m + 1)^ + 1) w ^^^-^ ((m + 2^ - m(m + l)^"^) . (46) 
m + 1 V / 

We summarize the discussion in the last two sections in the following: 

Observation 12. After smoothing (cf. Observation 9), the sequence a^"^^ = a^"^\l) , a^"^\2) , . . . 
of the lengths of the glue strings in yl(™) is a ruler sequence given by (29), where the record values 
are given by (38) (exact, for n < (m + 2)^ — 1) and (44) (approximate, for all m and n). Equation 
(46) gives an estimate for T^"^\n). 



4.6. The first occurrence of t 



We use the above estimates to determine where a number t > 1 appears for the first time in A. We 
already know from Tables 1 and 2 that a 1 appears at position 1, a 2 at position 3, a 3 at position 
9, and a 4 at position 220, so we may assume t > 5. 

For fixed t, let x{m) be the position where t appears for the first time in the sequence A^'^\ for 
1 < m < t. We will successively estimate x{t), x{t — 1), . . . ,x(l), working backwards from 

A^ ^ = t, . . . , t , t -j- 1, . . . , 
t+i copies 

where t appears as the leading term, and t + 1 appears for the first time at position t + 2. A more 
detailed analysis of the beginning of A^^^ , omitted here, shows that t + 2 appears for the first time 
at position 

it+iy^'+2t-i ^^^^ 

For example, 3 appears in A at position 9, 4 in A^'^^ at position 42, and 5 in A^^^ at position 343. 
Thus x{t) = 1, xlt -l) = t + l, and 

,(,_2) = a^ill±^. (48) 
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We first consider the case t = 5. Since x{3) = 343, a 5 appears in A^"^^ for tlie first time at the 
end of block B^"^^ , wliere i is such that T(2)(i) = 343. That is, i is determined (see (45) and Table 4) 
by the equation 

r(2)(i) = 1 + 1 + 3 + 1 + 1 + 3 + 1 + 1 + 9 + ... = 343. 

^ V ' 

i-i terms 

By direct calculation, i = 80, and again by direct calculation from (30), a 5 appears in 

^(2) 

at 

position 

x{2) = /?(2)(80) = 77709404388415370160829246932345692180, 

which is lO^^'^'". So 5 appears in A = A^^"^ at the end of block where i is such that T^^\i) = 
x{2). Setting jtt, = 1 in (46) we get 

.a.p. + i).|,3.(i-i@")..(2,, 

hence fi = 79.0 . . ., i = 2''^'^"\ Setting m = 1, n = 2'^^'^''' in (36) we finally obtain 

x(l) = /3W(2^9.o...)^^^22^-... 

for the position of the first 5. 

Consider now a general value of t > 6. To find x{t — 3), we must solve (from (48)) 

,(t-3)., (t-l)* + 2t-5 
^ ^ t-2 

Setting i = {t — 2)^ + 1 and using (46) we get 

t-'^fu u o^u oNM-A (t-l)* + 2t-5 



((i-l)'^-(t-3)(t- 2)^-1) 



t-2V ' ^ " ' ) t-2 
hence /i ~ t — 1, and so 

x{t - 3) = P^^\{t - 2)^ + 1) = esit - 2)(*-2)*"' . 

The £3 may be ignored, since it can be absorbed into the tower of exponentials. The next iteration 
gives 

^(i_4) = (t-3)(*-3)^^-^'"' 

and eventually we obtain 

t-i 

xil) = 2^'' , (49) 
a tower of height t — 1. We formalize this as our final 

Observation 13. The number t > 5 appears for the first time in the sequence A at about position 
(49). 

Remark. The position of the first 5 can be estimated more accurately using the tabular construc- 
tion of Section 4.2, thus avoiding the simplifying assumptions made in Section 4.5. This analysis 
predicts that the first 5 will appear at about position 

where £1 is given in §4.3. We omit the details. 
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5. Comments and generalizations 

5.1. The Finiteness Conjecture 

The proof of Theorem 5 would have been simpler if we had known in advance that the glue strings 
Sn™^ were finite. This would follow from the following: 

Finiteness Conjecture. For integers m > 2 and r >1, let x{l),x{2), . . . ,x{r) be a string from 
P^. Let x{n + 1) = C{x{\),x{2), . . . , x{n)) for n > r. Then for some n > r + 1, x{n) < m. 

In other words, there is no finite starting string from which extends under repeated appli- 
cation of the map C to an infinite sequence from P^. Sooner or later a term less than m must 
appear. 

Although this conjecture seems very plausible, we have not been able to find a proof. If one 
tries to construct a starting string which extends for a long time without dropping below m one 
quickly runs into difficulties. Let mi and m2 be respectively the smallest and largest values in the 
starting string. Then no number outside the range [mi,m2] — or in fact any number not in the 
starting string — may appear in the resulting sequence, for such a number is immediately followed 
by a 1, terminating the sequence. So if the sequence is infinite it must be bounded. 

As an experiment we considered all 2" starting strings of length n < 30 consisting just of 2's 
and 3's, and calculated the lengths of the resulting strings until just before the appearance of the 
first 1. The maximum and average lengths are shown in Table 9. The average length seems to 
approach n-\- constant, but the maximum length is harder to understand, and it would be nice to 
have more data. Does the maximum length continue to grow linearly, or are there further jumps 
of ever-increasing size? We do not know. 

Table 10 shows the starting strings of lengths 2, 4, 6, 8 and 11 (when there are jumps in the 
maximum length) and the strings of record lengths 4, 8, 14, 66 and 123 that they produce. These 
five starting strings are unique. 

5.2. Curling number transforms of other sequences 

It is interesting to apply the curling number transform to other sequences, particularly those for 
which the definition involves properties of substrings. For example, the binary Thue-Morse sequence 

0110100110010110 
1001011001101001... 

(A10060 in [8]) has the property that it contains no cubes UUU as substrings (see [2], [5], [6], [8] 
for further information). Its curling number transform, which naturally contains only I's and 2's, 
is 

1112112212122122 
1222122222122122... 

(A93914). We leave it to the interested reader to investigate the properties of this sequences and 
of the other new sequences mentioned below. 

There are many examples of ternary sequences which contain no squares, and of course their 
curling number transforms are simply the all-ones sequence 1. However, the lexicographically 
earliest sequence from P* whose transform is 1 is the ruler sequence r (A1511) mentioned in Section 
4.4. 
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We give one further example. The Kolakoski sequence is a sequences of I's and 2's defined by 
K{1) = 1, K{n) = length of n-th run: 



1221121221221121 
1221211212211211... 

(A2 in [8]). This also contains no cubes. The transformed sequence is 

1112121122122221 
1222112212221211... 

(A93921). 

5.3. Generalizations 

In this final section we briefly mention a few of the possible generalizations of the sequence A. 

(i) The recurrence (2) may be replaced by a(l) = 1, a{n + 1) = /(C(a(l), . . . ,a(n))) for n > 1, for 
any suitable function /. For example, f{x) = fioor(x/2) produces 

0010011112001001 

1 1 1 2 1 1 1 1 1 1 2 ... 

(A91970), which presumably has an even slower rate of growth than A. 

(ii) a(l) = a(2) = 1, a(n + 2) = C(a(l), . . . , a(n)) for n > 1 produces 

1112311123122121 
1221211222234111... 

(A94006). This has the property that its curling number transform is the same sequence but shifted 
one place to the left. 

(iii) A greedy version 

of ^(2)^ ^(1) ^ 2. For n > I, let k = C{g{l), . . . , g{n)). U k > 1, 
g{n + 1) = k (as in A^"^^), but if A; = 1, choose g{n + 1) so that C{g{l), . . . ,g{n + 1)) is maximized. 
If there is more than one choice for g{n + 1), pick the smallest. The resulting sequence (A94321) is: 

2223322233222322 
2322233222322232... 



(iv) A two-dimensional version of A. Define t{i,j), i > 1, J > 1, as follows: t{i, 1) = t{l,i) = a{i). 
For i,j > 1, t{i + l,j + 1) = max{fci,fc2}, where ki = C{t{i + 1,1), t(i + 1,2),... ,t{i + l,j)), 
k2 = C(t(l,j + l),i(2,j + l),...,t{i,j + 1)) — see Table 11 (A94781). The first two rows (or 
columns) give A and the third row (or column) is A^'^\ 

(v) J. Taylor [9] has suggested two broad generalizations of the original recurrence. Let ^ be an 
equivalence relation on strings of integers of each fixed length. Write 

a(l)a(2) . . . a(n) = XY1Y2 . . . ^ , (50) 

where the Yi are nonempty strings with Yi ^2 ~ • • • ~ ^fc and k is maximal; then a{n + 1) = 
k. Choosing ~ to be the identity relation gives A. Taylor has contributed several interesting 
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generalizations of A to [8] obtained from other equivalence relations. For example, if two strings 
are equivalent if one is a permutation of the other, the resulting sequence is 

1121122231121122 
2322232223322411... 

(A91976), which agrees with A for the first 19 terms. But after 

1121122231121122232 

the next term is now 2, not 1, since we can take [7 = 11211, Yi = 222311 2, 12 = 112223 2, 
where I2 is a permutation of Yi. 

(vii) More generally, Taylor suggests using a partial order ~< on integer strings of all lengths (not 
just strings of the same length), and requiring the Yi in (50) to be nonempty and satisfy li ^ 1^2 ^ 
• • • -< Ifc where k is maximal. For further examples of Taylor's sequences the reader is referred to 
the entries A91975 and A92331-A92335 in [8]. 
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n 


Max. 


Ave. 


1 


1 


1 


2 


4 


2.75 


3 


5 


3.75 


4 


8 


5.125 


5 


9 


6.2187 


6 


14 


7.5 


7 


15 


8.5703 


8 


66 


10.2734 


9 


68 


11.3828 


10 


70 


12.5293 


11 


123 


13.6099 


12 


124 


14.6658 


13 


125 


15.6683 


14 


132 


16.6957 


15 


133 


17.7047 


16 


134 


18.7168 


17 


135 


19.7206 


18 


136 


20.7278 


19 


138 


21.7304 


20 


139 


22.7341 


21 


140 


23.7353 


22 


142 


24.7372 


23 


143 


25.7379 


24 


144 


26.7388 


25 


145 


27.7391 


26 


146 


28.7396 


27 


147 


29.7398 


28 


148 


30.74 


29 


149 


31.7401 


30 


150 


32.7402 



Table 9: Maximum and average length of string produced by any starting sequence of n 2's and 
3's, stopping when first 1 is reached. 
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n = 2 



Starting string 2 2 produces length 4: 
2 2 2 3 



n = 4 Starting string 2 3 2 3 produces length 8: 
23232223 

n = 6 Starting string 2 2 2 3 2 2 produces length 14: 
22232223222332 

n = 8 Starting string 23222323 produces length 66: 

232223232223222322322232223232 
223222322322232223232223222322 
3 2 2 3 3 2 

n = 11 Starting string 22323222322 produces length 123: 
223232223222322322232223232223 
222322322232223232223222322322 
232322232223223222322232322232 
223223222322232322232223223232223 

Table 10: Starting strings of n 2's and 3's which extend under the map C for a record number of 
steps before reaching a 1. Each entry also shows the final string (until just before the first 1 is 
reached) . 



1 


1 


2 


1 


1 


2 


2 


2 


CO 


1 


1 


2 


1 


1 


1 


1 


2 


1 


1 


2 


2 


2 


3 


1 


1 


2 


1 


1 


2 


2 


2 


3 


2 


2 


2 


3 


2 


2 


2 


3 


3 


2 


1 


1 


3 


1 


1 


3 


3 


2 


1 


1 


2 


1 


1 


2 


1 


1 


2 


1 


1 


2 


2 


2 


3 


1 


2 


1 


1 


2 


2 


2 


2 


3 


2 


1 


1 


2 


1 


2 


3 


2 


2 


3 



Table 11: A two-dimensional version of the sequence. 
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